On the Statistical Comparison of Inductive Learning Methods
نویسندگان
چکیده
Experimental comparisons between statistical and machine learning methods appear with increasing frequency in the literature. However, there does not seem to be a consensus on how such a comparison is performed in a methodologically sound way. Especially the eeect of testing multiple hypotheses on the probability of producing a "false alarm" is often ignored. We transfer multiple comparison procedures from the statistical literature to the type of study discussed in this paper. These testing procedures take the number of tests performed into account, thereby controlling the probability of generating "false alarms". The multiple comparison procedures selected are illustrated on well-known regression and classiication data sets. 26.1 Introduction Recent interactions between the statistical and artiicial intelligence communities (see e.g. Han93, CO94]), have led to many studies that compare the performance of empirical statistical and machine learning methods on real-life data sets; examples are the StatLog We observe that there is no consensus in the research community on how such a comparative study is performed in a methodologically sound way. The ranking of k preselected methods is usually performed by training (estimating in statistical terminology) the methods on a single data set, and estimating their respective mean prediction errors (mpe) from a hold-out sample. The methods are subsequently ranked according to their estimated mpes. Some studies use, in our view appropriately, statistical signiicance testing in order to make this ranking. However, the eeect that comparing multiple methods has on the probability of generating a "false alarm" (claiming 1996 Springer-Verlag.
منابع مشابه
Comparison between traditional teaching and small groups teaching methods in teaching physiology respiratory system in medical students: Hormozgan university of medical science
Introduction: Lecture is the most applied method for teaching theory courses to medical students. This method is not accountable for the high current needs of students and leads to boring classes. This study was designed to assess the teaching effectiveness of active learning in small group sessions, compared with passive learning in didactic lectures. Methods: This study was carried out ...
متن کاملInductive vs. Deductive Grammar Instruction and the Grammatical Performance of EFL Learners
Learning a foreign language offers a great challenge to students since it involves learning different skills and subskills. Quite a few number of researches have been done so far on the relationship between gender and learning a foreign language. On the other hand, two major approaches in teaching grammar have been offered by language experts, inductive and deductive. The present study examines...
متن کاملThe effect of using computer games on learning free throw Basketball skill and comparison with skilled and learning pattern
The aim of this experiment was to study the effect of computer game on acquisition and retention of free throw basketball skills (accuracy and pattern) and compare this method with two patterning methods (skilled and learning). 50 high school students age range 15-17 years were selected as a research samples. Subjects after participating in the pretest were divided into 5 equal groups and each ...
متن کاملAn Explanation of the Teacher’s Personal Constructs: The Effects of the English Language Instructors’ Majors on their Orientations toward Teaching Reading Comprehension
This study seeks to examine how teachers’ educational background (those who have educated in TEFL in comparison to those who have educated in Literature) may influence their beliefs and attitudes on language learning, teaching and the teaching methods they apply in their classes. On a general basis, depending on teachers’ opinion, learning process might be seen as behavioristic, inductive, inte...
متن کاملComparative Experiments on Disambiguating Word Senses: An Illustration of the Role of Bias in Machine Learning
This paper describes an experimental comparison of seven different learning algorithms on the problem of learning to disambiguate the meaning of a word from context. The algorithms tested include statistical, neural-network, decision-tree, rule-based, and case-based classification techniques. The specific problem tested involves disambiguating six senses of the word "line" using the words in th...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1995